Libraries tagged by file extraction
causal/extractor
135133 Downloads
This extension detects and extracts metadata (EXIF / IPTC / XMP / ...) from potentially thousand different file types (such as MS Word/Powerpoint/Excel documents, PDF and images) and bring them automatically and natively to TYPO3 when uploading assets. Works with built-in PHP functions but takes advantage of Apache Tika and other external tools for enhanced metadata extraction.
wapmorgan/cab-archive
9375 Downloads
Reading and extraction of .cab-files
timber/wp-i18n-twig
732 Downloads
WordPress translations extraction for Twig files with WP-CLI
centertap/tika-all-the-files
67 Downloads
Mediawiki extension that provides extraction of searchable text and metadata from uploaded files, via Apache Tika
aspose/pdf
34 Downloads
A powerful library for manipulating and converting PDF files.
xatham/text-extraction
11 Downloads
Easy text extraction for many different file types
sergiodanilojr/zipper
48 Downloads
Zipper is a facilitator for creating Zip Files in an uncomplicated way, with features of Download, Insert Files, Extraction and Creation of Zip with multiple in one!
matejch/html_helpers
5 Downloads
Helper class for removing elements and content, and extracting file paths
fgsl/csvextractor
2 Downloads
component to extract data from CSV files to SQL database tables
fabiomez/data-extractor
4 Downloads
Library for data extraction from common resources like string or a CSV row from files
jeevi/cabinet
11 Downloads
Microsoft Cabinet file extraction wrapper. Uses either cabextract or expand
ibracilinks/ziparchive
133 Downloads
PHP zip utility for file compression, extraction and backup
cleentfaar/pharly
15 Downloads
A PHP library for the archiving and extraction of files and directories in .zip, .tar, .tar.gz, and .tar.bz2 formats.
joest8/pdfinterpreter
1 Downloads
This class is designed to convert multiple PDF files, whether image-based or text-based, into an array of data.The class uses user-defined templates containing regular expressions to control the data extraction process, allowing for customized and flexible output.
label305/docx-extractor
20045 Downloads
PHP library for extracting and replacing string data in .docx files.